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Abstract 

We consider an infinite-sized population where an infinite number of traits compete simultane¬ 
ously. The replicator equation with a diffusive term describes time evolution of the probability 
distribution over the traits due to selection and mutation on a mean-field level. We argue that 
this dynamics can be expressed as a variant of the Fisher equation with high-order correction 
terms. The equation has a traveling-wave solution, and the phase-space method shows how the 
wave shape depends on the correction. We compare this solution with empirical time-series data 
of given names in Quebec, treating it as a descriptive model for the observed patterns. Our model 
explains the reason that many names exhibit a similar pattern of the rise and fall as time goes by. 
At the same time, we have found that their dissimilarities are also statistically significant. 
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I. INTRODUCTION 


Sir Isaac Newton reportedly said that he could calculate the motion of heavenly bodies 
but not the madness of people, when he had lost a fortune in the South Sea Bubble. Since his 
time, there have been vigorous attempts to apply the so-called Newtonian approach to our 
society in order to understand the ‘madness of people’. It is interesting that some of modern 
economists have finally explained bubbles as based on rational expectation jl, 2], although it 
does not mean that bubbles are really under our control. At the same time, they distinguish 
a fad from a bubble as the former originates solely from social forces, which are harder 
to rationalize from an economic point of view. Even if we take a more phenomenological 
viewpoint, how this social influence organizes itself seems to remain largely unpredictable. 
A society adopts some practices and abandons some others constantly, but it is neither a 
matter of practical use nor that of aesthetic superiority. One might say that it is because 
something is ‘cool’, but it hardly explains anything but the unpredictability. It is another, 
perhaps a worse kind of ‘madness’ from a physicist’s point of view. 


Given names are subject to fads 


to the case of family 


1, 4 )]. It is sharply contrastec 

names, whose dynamics is well defined by mathematical models [[ 5 -8]. If a certain given 
name prevails, on the other hand, it is simply because it sounds cool, i.e., for no particular 
reason. Note that being a fad in this context does not necessarily mean that it is short¬ 
lived: For example, Michael was the most popular name in the United States for about half 
a century from the 1950s js]. Our point is that such popularity is not explained by any of its 
intrinsic properties, and we will call it a fad regardless of the time scale. Even if it is difficult 
to predict a fad, e.g., what will be the most popular name for babies next year, one may 
still expect a higher degree of regularity in the rise and fall of each given name. Fortunately, 
there is an available data set, compiled by Duchesne, which has recorded frequencies of 100 
major given names in the Canadian province of Quebec for more than a century 0 Q- 
It is basically a collection of names given to a certain number of people each year, from 
which the fractions of the given names are calculated. This collection is also called a corpus 
and its size, i.e., the number of people surveyed, has been greater than 2 x 10 3 every year 
throughout the 20th century. I 11 Fig. [0 the shade represents the fraction, and the names 
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FIG. 1. (a) Fractions of a hundred male given names and (b) those of the same number of female 

given names in Quebec, from year 1900 to 2000. The names are sorted in an ascending order of 
the mean time of prevalence [Eq. ([T])]. 


are sorted according to the mean time of prevalence, defined as 

E t tf(t) 


(t) = 


E t /(*r 


(i) 


where t runs from year 1900 to 2000, and f(t) is the fraction of a particular name in year t. 
This plot suggests the existence of a certain regular pattern. That is, fads propagate with 
forming a traveling wave across the space of names, from one name to another, and it is more 
clearly seen for female names. In addition, the narrower width of the name distribution along 
the vertical axis in Fig. [T^b) indicates that the change is more rapid for girls’ names. This 
observation implies that the female-name dynamics is probably simpler, in the sense that 
one only has to be concerned about how his or her daughter’s name will sound to others. 
Based on this observation, we will construct a descriptive model for the pattern in the 
female names. While the existing models in Refs, (d) and 4 consider the dynamics of a single 
name interacting with the environment, we are more interested in describing the competition 
among many names that takes place simultaneously. This many-body problem reduces to a 
reaction-diffusion system written in terms of the frequency distribution of newborn children’s 
given names, yielding a traveling-wave solution. By comparing the model with the empirical 
data, we will check whether our picture captures essential parts of the dynamics. 

This work is organized as follows. The next section presents our model, formulated as a 
generalized version of the Fisher equation [jjJ]. Section UTH then explains how we analyze the 
empirical data and compare them with our model. In this section, the goodness of fit will 
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be quantitatively estimated with a statistical test. After discussing the result in Sec. IIV1 we 
conclude this work. 


II. MODEL 


Our starting point is adoption-exploration dynamics of idea spreading [Idj]. According to 
this theory, we may imagine that each given name behaves as a biological species, competing 
to win as many adopters as possible. In other words, given names are assumed to obey 
certain evolutionary dynamics. To formulate this dynamics in mathematical terms, we need 
a number (or a set of numbers) to quantify the competitiveness of each name relative to 
that of others. Just as a species is said to have its own fitness [l4, 15], therefore, let us 
furthermore assume that it is possible to assign a certain value of ‘fitness’, represented by 
a real number x, to each given name. Our viewpoint is that the names are arranged and 
indexed by this fitness measure, and a name with x is picked up by a newborn baby’s parents 
from a certain probability distribution function. Note that our formulation mainly concerns 
how likely a name will be chosen for a newborn baby, rather than the fraction of all the 
people carrying that name. It is also important to remember that each given name has a 
fixed value of x, which is independent of time t, so that we can effectively map each x to 
a different name. For this reason, one can think of a; as a position in the space of names, 
analogous to the horizontal axis in Fig. [U although we do not actually have to determine 
the values for the data set in our analysis as will be seen below. So it is the fraction of x, 
instead of its value, that changes with time. Due to the growth of the fitter, the distribution 
will move to higher values of x in general. As a result, a time series for a specific given name 
can be unimodal at best. This might not be strictly true in Western countries [4]. But in 
Quebec, only a few percent of names succeed to come back. In our data set, therefore, such 
recurrence as considered in Ref. 4 can rather be treated as exceptional. 


Let us consider the following adoption-exploration dynamics [13j: 

dP d 2 P 

- = k(U-{U))P+D—, 


( 2 ) 


where P = P{x , t) is the probability density function at time t for finding a ‘species’ of fitness 
x G (—oo, Too), U = U(x,t ) is the payoff that the species gains, and (U) is the population 
average of U(x, t ) defined as (U) = f dxU(x, t)P(x, t). The first term on the right-hand side 
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(RHS) means that P(x,t) has a relative growth rate proportional to U(x,t), in a similar 


spirit to the replicator equation in evolutionary biology 16018]. One may also refer to 


Ref. [19 for mathematical aspects of a replicator equation on a continuous trait space. The 
second term on the RffS describes exploratory activities for new traits as diffusion along the 
x axis. The positive parameters k and D are thus called the rate of adoption and a measure 
of exploration, respectively. It is instructive to compare this with Eigen’s phenomenological 


theory of selection 


2d, 


21]. From our perspective, it is important that this theory of selection 


can generally be applied to any information carriers that reproduce themselves, including 
memes. The theory starts with the following rate equation for P t , the concentration of an 
information carrier % (i — 1,... ,n): 


d 


— FiPi — RiPi + ^^(0j jPj — 


(3) 


where the first and second terms refer to self-instructed reproduction and removal of the 
carrier, respectively, whereas the last summation contains all the other production such as 
mutation. As one of possible conditions for selection, we may choose JT P* = const. The 
reproduction and removal can be specified further as follows: Let us set F t = k^A^Qi, where 
ko is a rate constant, A* is an amplification factor, and Qi is a quality factor of precise 
reproduction. Likewise, we set Ri = k^Bi + 0 O , where P* is a decomposition factor and 
</>o is a dilution factor, which is controlled by the boundary condition and assumed to be 
independent of i. We now obtain a more detailed form of the rate equation: 

^ Pi = k 0 ( AiQi - Bi ) Pi + ^faPj ~ 4>jiPi) ~ 0o Pi- (4) 

j 

If we sum up both sides of Eq. fl3j) over i, we find that 

0 = ^ k 0 (AiQi ~ Bj ) Pj - 0o ^ Pi, (5) 

i i 

because 'Yhi'Yhji.&ijPj — <t>jiPi ) identically vanishes. If we define a selective value as Ui = 
AiQi — Bi, Eq. ([5]) means that (p 0 = k 0 J2iUiPi/^2iPi = k 0 (U). Equation flU) is thus 
rewritten as 


-^P, = k 0 (i Ui - (U)) Pi + ^(0^-P, - ^iPi). 


dt 


( 6 ) 


Supposing that the mutation rates faj are symmetric, homogeneous, and short-ranged in 
the index space, one can approximate the last term by a second-order differential and derive 
Eq. ([2]) through a suitable limiting process [21]. 
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It has been suggested in Ref. 13| that U(x, t ) can be identified with the cumulative distri¬ 


bution C(x,t) = f* P(x',t)dx' up to a proportionality coefficient absorbed into k, because 
those with x' < x are potential adopters of x. In the general context of fads, however, the 
payoff function might not be such a simple linear function of C(x,t). For example, there 
can be an adverse effect of being too popular. Let us generalize the above argument in the 
following way: We assume that there eventually develops a traveling wave with speed v and 
a unimodal shape of P(x — vt). At any time, the corresponding C(x) should be a monoton- 
ically increasing smooth function of x within a certain region. Our claim is that it can then 
be used as a basis for expressing an arbitrary shape of U(x) within the region of our interest. 
The idea is as follows: If we define g = C(x), there is one-to-one correspondence between x 
and fi as long as P > 0, and one can write U as a function of g instead of x. We expand U(g) 
as a polynomial in g, i.e., U = ao + a±g + aog 2 + • ■ • = ao + a\C(x) + d 2 C 2 (x ) + • • • with coef¬ 
ficients a, : . Clearly, a 0 does not alter the dynamics as (U) is subtracted, and cq is absorbed 
into the equation by rescaling the time scale. We will therefore retain the two lowest-order 
contributions as U(x,t) = C(x,t ) + § gC 2 (x,t) with a constant g, where the factor of § is 
added for later convenience. This constant g is introduced as a shape parameter to control 
the skewness of the distribution, but it can also be interpreted as the population’s preference 
to new fads. If g is negative, for example, it means that the population is ‘conservative’ in 
the sense that the fitness U does not increase with x as sharply as it would under positive 
g. Noting that 

C n Pdx = [ C n dC = —, (7) 

JO 71+1 



we rewrite Eq. (EJ) as 


dP 

dt 


c + laC 2 -l(l + g) 


P + D 


d 2 P 
dx 2 ' 


( 8 ) 


By rescaling t and x as kt and x\JkjD , respectively, we reduce the equation to 


dP 

~dt 


C + 2 sC 2 - -(1 + g) 


P + 


d 2 P 
dx 2 ’ 


By using = P, this can be written as 


d 2 C _ 8 
dtdx dx 


C 2 9^ 3 ^ d 2 C 

T + f c -2 (1 + 9)c+ a? 


which implies that 


dC 1 d 2 C 

— = -C{C-\){ J gC + g + l) + -^ + <T{t), 


(9) 


( 10 ) 


( 11 ) 
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where a(t) is a function of t only. Substituting x = ±oo, we find that a{t) = 0 because the 
left-hand side and the first two terms in the RHS identically vanish there. To sum up, we 
will consider the following equation: 


^-lc(C-l)(gC + g + l) + ^. 


( 12 ) 


This form is close to the Zeldovich equation 22], although the phase-space structure is 


different as will be detailed below. If g — 0, this is mathematically equivalent to the 


celebrated Fisher equation 


inhibitor model in Ref. 




This model can also be interpreted in terms of the activator- 


31 (see Appendix [A] for details). In addition, one may recall that a spin 
system is phenomenologically expressed by the time-dependent Ginzburg-Landau equation 
as 

dm 

= m — m 3 + V 2 m, (13) 

where m denotes the order parameter { 23 1. One keeps the linear and cubic terms of m to 
describe the symmetric double-well structure of the free-energy density functional. I 11 this 
respect, the additional term proportional to g in deriving Eq. (TT2l) can be interpreted as 
introducing a cubic term to the RHS, although we are not dealing with a magnetic system. 

The assumption of the traveling wave implies that C(x,t) = C (x — vt), where v is the 
speed. In a moving frame with coordinate ij = x — vt, Eq. (TT2|) describes the shape of the 
wave as a second-order ordinary differential equation: 


dC 1 ^ w „ . d 2 C 

~ l % ~ 2° {C ~ 1)( ® C + S + 1)+ Af’ 

or, equivalently, a set of two first-order ordinary differential equations: 


(14) 


I dC/dr] = P 
| dP/drj = —vP 


\C{C — l)(gC + g + 1), 


(15) 


where the first line relates the probability density function P and its cumulative C. This 
system has three fixed points at ( C,P ) = (0,0), (1,0), and (—1 — (R^O). The linear 
stability analysis shows that the eigensystem around the origin consists of eigenvalues A± = 


± ^v 2 + 2 + 2 g 


and eigenvectors 


y± = 



(16) 
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FIG. 2. (Color online) P versus C , where P is scaled by its maximum amplitude P ma y . The speed 
is chosen as v = 2. Note that the flow of Eq. m goes from (C, P ) = (0,0) to (1,0) as rj = x — vt 
increases, (a) The solid line with g = +0.4 has ^ > 0 at C = ^ in this plot, and P(rj) has negative 
skewness. If we fix x and observe its time series as t varies, therefore, the height grows rapidly and 
then decreases slowly, (b) The dotted line with g = —0.4 shows the opposite case, as explained in 
the main text. 


On the other hand, the eigensystem in the vicinity of ( C,P ) = (1,0) is obtained as fi± = 
v ± y/v 2 — 4g — 2 1 , together with the corresponding eigenvectors: 


z± = 



(17) 


If v < yfig + 2, the eigenvalues become complex, which is not physically reasonable because 
C would not be bounded within the unit interval. This proves the existence of the minimum 
speed v m i n = \/4g + 2 below which one cannot And a traveling-wave solution (see Appendix iBl 
for more discussion and references on the speed selection principle). If <7 = —1, the wave 
can stop, meaning that the population is so conservative that there might be no progression 
to higher x. The last fixed point at (C,P) = (—1 — g _ 1 ,0) has an eigensystem of = 


-v ± y/v 2 - 4g 2 - 6 - 2 g- 1 


and 


W-t- = 



(18) 


The last fixed point should not lie between the other two, because it is meant to modify the 
flow from (0,0) to (1,0), leaving the overall phase-space structure unaltered. This restricts 
the possible range of g to g > — With this parameter g, one can control the skewness of 
the wave shape: Consider a trajectory from (0,0) to (1,0) in the ( C,P ) plane, as described 
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FIG. 3. (Color online) (a) More than two-thirds of the names show left-skewed P(g ), and one such 
example is plotted here. It means that they quickly become popular and then slowly disappear, 
because fj oc —t. The noisy curve represents the corpus data, and the smooth one is obtained from 
our model. This example is fitted with v = 1.7 and g = —0.06 (see the main text for details of the 
fitting), (b) A dozen of names are equally well or better fitted to right-skewed distribution. The 
fitting parameters are v = 2.4 and g = —0.43 in this example, (c) Recurrence is observed for a 
couple of names like Gabrielle and Marie-Anne, roughly with a centennial period, (d) There are a 
few ‘steady sellers’, including Rachel and Elisabeth, with no particular patterns. 


by Eq. (1T5|) . If the shape of P(rj) is negatively skewed, we will see ^ > 0 at C — |, and 
vice versa (Fig. EJ). Setting C = we see from Eq. (TT5T) that 


dP 1 + 3o/2 

- — —V ~\~ - 

dC 8 P* 


(19) 


where P* is the value of P when C = \. Equation (HU]1 shows that one can change the sign 
of hence the skewness, by modulating g. It is not a linear relationship, though, because 
P* also depends on v and g. A negative value of g close to its lower bound — i will lead to 
a positively skewed shape of P[rj) (see Fig. [2]). As this wave travels in time, we will see that 
fads are adopted slowly and then abandoned rapidly, compared with the case of positive g. 
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III. DATA ANALYSIS 


For given v and g, we can obtain P(rj) = P(x — vt) by numerically integrating Eq. (fT5]h 
It is not directly observable, however, because it is difficult to measure x in practice. The 
available data set is given as a collection of time series, one for each different given name. If 
we choose one of them, x is fixed by assumption, and the observed time series corresponds 
to P(t — x/v). That is, if compared with the hypothetical distribution P(rj ), the time series 
would be a reflected and scaled image along the horizontal axis. Let us stress the underlying 
assumptions behind this statement: First, the parameters such as v and g are assumed to 
be stationary, at least approximately, to relate the observed time series to P{rj). Second, 
our model assumes that every name is equal, except for its value x. Therefore, in principle, 
every name is expected to exhibit a similar pattern in its time series. Recall that each time 
series should be related to others by time translation, because it appears as P(t — x/v). For 
each time series to be regarded as a projected image of a probability distribution function, 
therefore, our data should be traced backward with normalizing the area under the curve to 
one. In other words, the time series /(f) up to f = t max transforms to /(f max — t)/ /(f), 

which we denote as P(fj) with fj = f max — f. 


Every year from 1900 to 2000, we randomly sample 2 x 10 3 people out of the corpus and 
will work with this sampled data henceforth. Otherwise, it would be difficult to compare the 
number of people given a particular name, counted over the century, with that of another 
name, because the size of the corpus varies year by year. The resulting P(fj) for the Quebec 
female names shows four representative types: The first is negatively skewed as depicted in 
Fig. E3a), and this comprises the major part of the hundred names. We have found negative 
skewness for about 60 names, and the actual number could well be greater than this, because 
very old or new names show only one of the tails in their time series so that their skewness 
cannot be determined precisely. There are less than ten names that are better described by 
right-skewed distribution [Fig. [21(b)]. Only a couple of names exhibit such recurrent behavior 
[Fig. [31(c)] as has been considered in Ref. 4. Note that the recurrence is entirely ignored in 
our model. Finally, we find three names that are noisy around their respective mean values, 
and thus not covered by any of the categories above [Fig. [21(d)]. 


The sampled data points are fitted to our model curve, integrated from Eq. (fT5jh The 
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objective is to minimize the following cost function: 


X 


2 


£ 


(rij - Npi) 2 
Npi 


( 20 ) 


where n* is the number of people given a particular name at year i, N is the total number of 
people given this name over the period from 1900 to 2000, and pi is the predicted fraction 
from our model. We interpret this problem as distributing N people into 101 bins from 1900 
to 2000. We should have four fitting parameters, two of which are v and g , and the other 
two determine translation and scaling along the horizontal axis. We have added one more 
for a vertical shift, because the y 2 statistic [Eq. fl20]) ] is too sensitive to background noise: 
Our model describes how a fad of a name catches on and fades away over a certain period, 
so it has pi ~ 0 outside that period. In practice, however, the name can still be found as a 
stochastic effect. If pi is vanishingly small there, the cost function y 2 diverges. If the noisy 
fluctuations are not taken into account, therefore, the fitting program will try to avoid this 
divergence at every cost, even if it misses the main signal. The number of degrees of freedom 
thus amounts to d/ = 101 — 1 — 5 = 95. The rule of thumb is that y 2 should be of O(df). 
The conventional criteria for statistical significance allow y 2 to be roughly as high as 130, 
which is observed with probability 0.01 in the y 2 distribution. 

The parameters are adjusted by the Metropolis algorithm, in which Eq. (1201) plays the 
role of the energy function. The energy landscape will be simple enough to apply the zero- 
temperature Monte Carlo method, as long as the initial condition is reasonably chosen. To 
estimate the possible ranges of the fitting parameters, however, we choose a hnite tempera¬ 
ture T = 1 because fluctuations of 0(1) in y 2 will hardly affect the statistical significance. 
The initial parameters for translation and scaling can be estimated by checking the statistical 
moments of the data, and we try two initial values of g = 0 and g = 0.4, with v = tw n (g), 
for faster convergence. The resulting distribution of y 2 for the hundred female names is 
plotted in Fig. [4](a). The figure shows that about 40 percents of the hundred female names 
are successfully described by our model, yielding y 2 < 130. For these names, we also plot 
the ranges of their fitting parameters in Fig. IU(b). Here, one immediately finds a cluster 
of names with g cs 0 and v < 5, which is separately depicted in the inset. We see from 
this figure that many of the parameter values do overlap as predicted by our theory. It is 
also interesting that the estimated speed is not far from u min , if we consider that a new fad 
would start with a sharp initial condition. Figure [U(c) shows the estimated parameters of 
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FIG. 4. (Color online) Monte Carlo fitting results with the Metropolis algorithm. The energy 
function is identified with y 2 and the temperature is set to be T = 1. (a) The distribution of y 2 
[Eq. (1201) ] for the hundred female names after the fitting. The vertical line means y 2 = 130, below 
which the statistical significance is greater than 0.01. (b) Distribution of v and g for the names 
fitted with y 2 < 130. The error bars express the ranges of the parameters during 10 5 Monte Carlo 
steps. Inset: A zoomed view. The dotted lines represent u m ; n = ^/4 g + 2. (c) Estimated v and g 
against the mean time of prevalence [Eq. ([!])] for the names in the inset of (b). (d) Actual data 
and fitting curves for two outliers on the (g,v) plane, indicated by the arrows in (b). 


the names inside the cluster, against their (t) [Eq. (JT])]. It confirms that the parameters 
have been stationary throughout the period of observation, which is one of our underlying 
assumptions. 

We also have a few outliers that accept fairly large v or g. Let us take a closer look by 
plotting the data for two of them, Genevieve and Brigitte [Fig. ID[d)]. Then, we immediately 
find that the large v of the former name is likely to be an artifact due to the steady noise level, 
because a broader distribution may be interpreted as faster propagation (see Appendix [B]). 
The same explanation applies to all the other names with large error bars in Fig. |4](b). On 
the other hand, the latter case of Brigitte shows exceptionally large skewness in spite of 
rather stable parameter estimations [Fig. 01(b)] . Such a distorted shape is probably caused 
by external factors that our model cannot account for. 

Overall, we conclude that our model does explain a part of the observed data, because it 
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describes about 40 names with consistent values of v and g. At the same time, the degree 
of heterogeneity among the given names has turned out to be higher than we expected from 
Fig. m 


IV. DISCUSSION AND SUMMARY 

In summary, we have constructed a model to describe a pattern of fads. Our working 
hypothesis has been that although each fad is unpredictable, the overall pattern tends to 
repeat itself over and over again. Figured] provides a striking indication of this hypothesis, 
and suggests that the governing dynamics can develop a traveling-wave solution with a well- 
defined shape. We have built up a mean-held theory of selection and mutation to explain 
the observation in Fig. |T] The assumption of a traveling-wave solution implies that the 
time series of every name can be fitted to a single curve if translated horizontally. We have 
introduced a shape parameter g in addition to the wave speed v, and checked whether these 
parameters give consistent estimates over the hundred time series. Our model must have 
oversimplified the reality as we all know that the naming dynamics is involved with many 
accidental factors such as celebrities and mass media. So as long as the theory predicts 
sensible behavior, the question should be how much these random fluctuations alter the 
dynamics. Our analysis suggests the existence of the regularity on a qualitative level: Most 
of the names are described by unimodal left-skewed distribution, which means that it is 
common to adopt a fad rapidly and then abandon it slowly. Quantitatively, we have found 
meaningful fitting results for about 40 names among 100. Since we have missed more than 
a half, we also conclude that the heterogeneity is not so negligible as assumed in the theory. 
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Appendix A: Activator-inhibitor model 


Reference 3 suggests an alternative model to explain the empirical pattern of given names 
as follows: Consider a particular name and let P(t) and Q(t) be the fraction of couples that 
give the name to their babies and the fraction of people that bear the name at time t, 
respectively. The former fraction of couples ‘activate’ the use of this name, whereas the 
latter fraction of people tend to ‘inhibit’ it. If K,(t) means the mortality rate at t, the change 
of Q(t) over a time scale r can be written as 


j t Q(t) = t 1 P(t ) - K(t)Q(t), 


with a formal solution: 

Q(t) = r-'e- 1 ® 


P^e^dt' + Q{ — oo)e I ®, 


(Al) 


(A2) 


where I(t) = /*_ n(t')dt'. The second term on the RHS can be discarded by setting the 
surface term at t — —oo as zero. On the other hand, the governing equation for P(t) is 
assumed to be 


d 

dt 


P{t ) = -aP\t) + f3 


1 

1 

T—i 

1- 

^ Q(t) 1 

Ps J 

Qs J 


m, 


(A3) 


where a, (3, l, P s , and Q s are model parameters. The first term on the RHS describes a 
threshold effect, and P s and Q s mean saturation levels for P and Q, respectively. 

In this appendix, we compare this activator-inhibitor model with our approach. Let us 
first assume that the threshold phenomena are negligible and that P <C P s all the time. 
Then, the equation simplifies to 




Q{t) 

Qs 


P(t). 


(A4) 


If the mortality rate is small enough within the time scale of observation, we can approximate 
Eq. (jA2j) as 

Q(t) ~ t~ 1 f P{t')dt'. (A5) 

J — OO 

Our model considers a traveling wave of velocity v > 0 represented by P(x,t) = P(x — vt). 
It implies that the following integral over time 

pt px—vt 

/ P(x — vt')dt' = —u _1 / P(x — vt')d{x — vt') (A6) 
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is related to the cumulative distribution 


rx—vt 


C(x — vt)= / P(x' — vt)dx' = 


P(x' — vt)d(x' — vt). 


(AT) 


Roughly speaking, therefore, it is 1 — C(x, t) that plays the role of inhibition from those who 
bear the name in our model. Plugging this into Eq. (I A 5(1 . we see that the increase of P is 
proportional to C, which can be written as 

dP 


dt 


— k(C — (C)) P, 


(AS) 


where k is a proportionality constant, and we need (C) = C(x',t)P(x',t)dx' in the 

parentheses to guarantee the conservation of total probability j°° P{x',t)dx' = 1. The 
constant k can be absorbed into t by rescaling the time scale. Reference Ell shows that 
Eq. (1A8I) has a solution 


C(x, t) = - tanh 


h(x) — 


+ 


(A9) 


with a certain function h(x) such that dh/dx > 0, h(x —> +oo) = +oo, and h{x —* —oo) = 
—oo). If this is a traveling wave with a constant velocity, h(x ) should be a linear function 
and P(x, t) thus has a peak with zero skewness. Reference (Ell has found that we can make 


it skewed just by adding a diffusive term to the ‘reaction’ between activators and inhibitors 
that Eq. (1A8|) describes. The present work has generalized this finding by introducing a 
shape parameter called g to control the skewness. It is straightforward to see that Eq. (1A8j) 
is obtained from Eq. OS} by taking g — 0, k — 1, and D — 0. The activator-inhibitor model 
in Ref. 3 differs from our reaction-diffusion approach in that it mainly focuses on elaborating 
the reaction kinetics, e.g., by including threshold and saturation effects for P. In contrast to 
such single-name dynamics, the diffusive process introduces other names to the population, 
playing a similar role to mutation in biology. 


Appendix B: Speed selection principle 


How fast the traveling wave propagates into a linearly unstable state is one of main 


concerns when we study a reaction-diffusion system. This is usua’ 
propagation problem. In the context of the original Fisher equation 


ly known as the front 


12], the traveling wave 


describes expansion of a population, whose density is denoted as u(r, t ), where r means a one¬ 
dimensional real space. Under an assumption that u spreads with keeping a constant shape 
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at an asymptotic speed w as in the large-f limit, u(r,t ) can be approximated as u{r — v as t). 
The speed v as of the nonlinear wave is governed by its initial profile at t — 0 and the leading 


edge 


24 


28]. Let us consider a monotonically decreasing wave profile, such as u ~ e 


—C r 


It is known that there is a critical value ( = (*, above which the wave propagates with 
the lowest speed v*. Therefore, if the initial profile is steeper than e~+ r , the wave moves 
forward at the asymptotic speed v as equal to v*. Otherwise, it travels at v as larger than 
v*, and v as is controlled by the dynamics of the edge which is obtained by linearizing the 
original equation. 

In our notation, 1 — C maps to the population density u in the Fisher equation. As 
far as a well-mixed population is concerned, the spatial dependency is negligible and we 
have u(r,x,t ) = u(x,t). For finding the minimum propagation speed v*, let us substitute 
u = 1 — C into Eq. ([TUP to obtain 


du 1 . . . , d 2 u 

- = -u(l-u)(l + 2 g -2u) + ^. 


(Bl) 


Equation (IBID is linearized at the edge (uCl), which results i 


m 


du 1 . . d 2 u 

m - 2 u(1 + 2j) + a?' 


(B2) 


Suppose that the wave profile has an exponential tail in the x space, i.e., u(x, t ) = u(x—vt ) 
e -C (x-vt)' j£ we pj U g ansa £ z i n t 0 Eq. (1B2j) . it yields a quadratic equation for £: 


C 2 - + -(1 + 2g) =0. 


(B3) 


Solving this equation, one obtains a dispersion relation for the edge speed as a function of 
( as follows: 

« ( C) = C+L( 1 + 2 9 ). (B4) 

This expression diverges as C —•t 0 or £ —> oo and always has a minimum at (* between the 
two regimes. The derivative of Eq. (1B4I) with respect to ( vanishes at such (*: 


which has two extrema at ( 
minimum speed: 


dv(Q 

d( 


1 - 


1 + 2 g 

C 2 


0, 


(B5) 


±a/ (1 + 2g)/2. The one with the positive sign gives the 


u*= v/2(l + 2 g). 


(B6) 
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Then, (* can be expressed self-consistently in terms of v*, i.e., (* = v*/2. 
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